Robust Audio-Visual Person Verification Using Web-Camera Video

نویسندگان

Daniel Schultz

Timothy J. Hazen

James R. Glass

Arthur C. Smith

چکیده

This thesis examines the challenge of robust audio-visual person verification using data recorded in multiple environments with various lighting conditions, irregular visual backgrounds, and diverse background noise. Audio-visual person verification could prove to be very useful in both physical and logical access control security applications, but only if it can perform well in a variety of environments. This thesis first examines the factors that affect video-only person verification performance, including recording environment, amount of training data, and type of facial feature used. We then combine scores from audio and video verification systems to create a multi-modal verification system and compare its accuracy with that of either single-mode system. Thesis Supervisor: Timothy J. Hazen Title: Research Scientist, Computer Science and Artificial Intelligence Laboratory Thesis Supervisor: James R. Glass Title: Principal Research Scientist, Computer Science and Artificial Intelligence Laboratory

متن کامل

منابع مشابه

Robust face-voice based speaker identity verification using multilevel fusion

In this paper, we propose a robust multilevel fusion strategy involving cascaded multimodal fusion of audio–lip–face motion, correlation and depth features for biometric person authentication. The proposed approach combines the information from different audio–video based modules, namely: audio–lip motion module, audio–lip correlation module, 2D + 3D motion-depth fusion module, and performs a h...

متن کامل

UCBN: A new audio-visual broadcast news corpus for multimodal speaker verification studies

The performance of face, voice, and multimodal speaker verification systems in complex and non-controlled scenarios, is typically lower than systems developed in highly controlled environments. With the aim to facilitate the development of robust multi-modal speaker recognition systems, a new multi-modal (audio-visual) Australian broadcast UCBN (University of Canberra Broadcast News) corpus was...

متن کامل

Face Video Competition

Person recognition using facial features, e.g., mug-shot images, has long been used in identity documents. However, due to the widespread use of web-cams and mobile devices embedded with a camera, it is now possible to realise facial video recognition, rather than resorting to just still images. In fact, facial video recognition offers many advantages over still image recognition; these include...

متن کامل

Robust person verification based on speech and facial images

This paper describes a multi-modal person verification system using speech and frontal face images. We consider two different speaker verification algorithms, a text-independent method using a second-order statistical measure and a text-dependent method based on hidden Markov modelling, as well as a face verification technique using a robust form of corellation. Fusion of the different recognit...

متن کامل

Audio-visual interaction in multimedia communication

To many people, the word “multimedia” simply means the combination of various forms of information: text, speech, music, images, graphics and video. What is often overlooked is the interaction among these forms. In this paper, we will present our recent results in exploiting the audio-visual interaction that is very significant in multimedia communication. The applications include lip synchroni...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

متن کامل

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2006

Robust Audio-Visual Person Verification Using Web-Camera Video

نویسندگان

چکیده

منابع مشابه

Robust face-voice based speaker identity verification using multilevel fusion

UCBN: A new audio-visual broadcast news corpus for multimodal speaker verification studies

Face Video Competition

Robust person verification based on speech and facial images

Audio-visual interaction in multimedia communication

عنوان ژورنال:

اشتراک گذاری